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I.  INTRODUCTION 


The  standard  binomial  sampling  problem  involves  repeating  an  experiment 
m  times  and  counting  the  number  of  successes.  If  the  random  variable 
representing  the  number  of  successes  is  X,  then  when  we  assume  X  has  a 
binomial  distribution  we  are  saying 


nivl.nJv _ \  _  ml  x  m  -  x 

p(X}-p(X-x;-  x|(n;  ;k)i  P  q 

where  q  =  1  -  p  and  x  may  take  the  values  0,  1,  2 . m.  Here  P  jx  j 

stands  for  the  distribution  function  of  X  or  the  probability  that  X  takes  a 
particular  value.  From  the  observed  number  of  successes  in  m  trials,  we 
are  interested  in  obtaining  an  estimate  of  the  unknown  probability,  p,  assumed 
to  be  constant  from  trial  to  trial.  The  well  known  answer  to  this  estimation 
problem  is  to  estimate  p  by 


where  s  is  the  number  of  successes  in  m  trials.  This  estimate  has  been 
shown  to  have  the  desirable  properties  of  unbiasedness,  and  minimum  variance. 
The  binomial  sampling  problem  discussed  in  this  paper  is  different  in  that  in 
addition  to  estimating  p,  we  also  need  an  estimate  of  the  number  of  trials. 

n.  THE  PROBLEM 


During  a  search  we  assume  that  there  are  a  fixed  but  unknown  number  of 
objects,  m,  available  to  be  detected.  We  assume  that  the  probability  of 
detecting  rn  object  is  an  unknown  constant  p.  We  repeat  the  search  k  times. 
All  objects  detected  in  the  first  search  are  tagged  In  some  manner  so  that 
during  the  second  search  all  new  detections  will  be  recognized  and  tagged.  This 
is  repeated  for  all  k  searches.  We  record  after  each  search  only  the  new 
detections  made  during  that  search.  We  then  have  k  random  variables 


dj,  j  =  1,  2,...,  k 

where  dj  is  the  number  of  new  detections  made  during  the  j-th  search.  We  want 
to  estimate  p  and  m  from  the  observed  values  of  d^. 


III.  RESULTS 


The  search  problem  given  above  describes  sampling  from  a  binomial 
pi^mlation.  We  eas  ily  see  that  the  distribution  of  dj  is  given  oy 


P(d,) 


m! 
(m  ■ 


m 


1 


The  distribution  of  d2  given  the  value  of  d^  is  then  seen  to  be 


(m  -  dA)l 


*2  m  ‘  dl  '  d2 


P(d2!dl)sd'!(m-d-  d^)lP  "0 
In  general,  then,  the  distribution  of  dj  given  the  d^,  d2»...d^  _  ^  is 


(m  ~  d  j  -dn  -  d.  _  ,)!  d.  m  ~  d ,  cL •  •  ~d. 

p(dj  |  dr  d2 . dj  _  j)  =  d  |  (m  .  d  a  d,)l  p  \ 

j  1-2  J  (1) 

From  the  above,  it  follows  that  the  loint  density  of  all  the  d.,  j  =  1, . . . ,  k 


the  product  of  all  die  conditional  densities  given  above,  is 


j’ 


o, .  .  ,  *  _ _ ml _ S  k (m  -  S)  +  T  -  S 

^1'  *2 . V  dj!  d2r. .  !  (m  -  dj  -  c^. .  .tyl  p  q 


(2) 


where 


The  joint  characteristic  function  of  the  dj  is  the  expected  value 

B{e'[v.+Va+---+<SAi} 
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and  is 


dl  d2 


l  l"X  d1ld2i...dknm-d1-d2-...-dk) 


Cp"1) 


y  r  il2\  2  (  ^  k(m  -  S)  +  T  -  S 

x  (pe  )•'  ‘Ipe  J  Q 

Performing  the  summations  gives 

/^l  ^2  ,  k  -  1  ,  k  Nm  /o\ 

o(tr  t2 . ty  =  (pe  +Pqe  +•••  +PQ  e  +(!  ;  *  (3' 

Now  the  characteristic  function  of  a  binomial  random  variable  defined  by  the 
parameters  N,  P  (N  trials,  P  constant  probability  from  trial  to  trial) 

is  r  iT  ia\ 

cp(T)  =  |  Pe  1  +  Q  J  .  (4) 

If  in  (3)  we  set  all  the  t's  equal  to  zero  except  ty  we  see  that 

ilj 

cp(0,  0 . tjt  0.  0 . 0)=(p+pq+...+  i>qi'2+P‘l)'1<:  + 

.  .  .  vltl  .  .it.  i  -  1  N  m 

pqJ  +  ...+pq^  Vy  *(pqi_  e  )+U-pqJ  >)  • 


By  analogy  with  (4)  we  then  have  that  the  distribution  of  the  individual  d.  is 


By  setting  in  (3)  all  the  t,  equal  to  t  we  find 


(t,  t,  .  .  0  = 


k  -  p  it  ,  k 
p  +  pq  +. . .  +  pq  e  +  q 


.  i  1  -  qk  _  cU  +  qk  y  • 
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and  so  again  by  analogy  with  (4)  we  see  that  the  distribution  of  S  =  T  d.  is  given 
by  J 


m! 


S  m  -  S 

k  \  r  k  ) 


*  -  ^  ; 


(6) 


Pm(S)  S!  (m  -  S)! 

From  (2)  we  may  find  the  maximum  likelihood  estimate  of  p  by 
differentiating  log  P(dj,  d2»**«»djc)  with  respect  to  p  and  finding  the  value 

of  p  which  makes  this  derivative  equal  to  zero.  This  leads  to 


/>  .. _ S  _ 

p  "  k(m  -  S)  +  T  * 


(7) 


Again  starting  with  (2)  it  is  possible  to  show  that  S  is  sufficient  for  m. 
This  follows  from  the  fact  that  (2)  may  be  rewritten  as 


P(d  ... i  dj^)  P(S,  m)  P(X|.  X2 > » •  •  >  _  j) 


m! 


S  '.k>m  "  S 


S!  (m  -  S)! 


0  *  V)  (VO  •  p(xi . \  -  1> 


(8) 


where 

.  x)  =  x^..^  .  *(S  -  xr  x)l  (1  P  qk)S 

k-1 

X  q  <k  -  1)  S  +1  (j  -  k)Xj 

J  *  1 

and  P  (S,  m)  is  the  same  as  Pm(S)  defined  in  (6). 

The  first  factor  on  the  right  side  of  (8)  is  a  density  as  was  already  seen  in 
(6)  The  second  factor  is  also  a  density  as  seen  from 
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X 


g 

^P(xl . *k  -  ?  =  (~f“ k) 

1  -  q 

k-1 

_  -  Cl  (k-DS+l  (j-k)x. 

V  \  \  S!  q  i  ] 

L  L  L  xr* •  -^-r  (s"xi“* •  •  Iv7! 

X1  X2  \  -  1 


k-1 


in  which  the  sum  on  the  left  is  taken  over-all  (x^, ....  p  such  that  ^  x.  =  S  . 

j=l  J 

The  second  factor  on  the  right  in  (8)  comes  from  the  transformation 


Vi*dk-i 

S  *  d  j  +  dj*. .  .+djc 


applied  to  (2).  The  factorization  in  (8)  means  that  S  is  sufficient  for  m.  Now 
we  would  like  to  have  a  function  of  S  as  an  estimate  of  m  which  is  unbiased  and 
unique.  We  want  to  use  a  result  which  says  that  if  a  function  of  a  sufficient 
s  atistic  is  both  unbiased  and  unique,  it  is  the  best  statistic  for  estimation  in  the 
sense  of  having  minimum  variance.  We  propose  as  this  function  of  S  the  quantity 


where  P  = 


(9) 


Now  we  want  f(S)  to  be  unbiased  for  all  m.  This  means 


E i  f(S)  =  m,  for  all  m 

(10) 

m 

}  f(S)P  (S)  *  m  for  all  m. 

L  m 

(ID 

S-0 
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where  P(S)  is  defined  in  (6). 


We  want  to  prove  that  f(S)  given  in  (9)  is  unique.  St^ipose  S  ^  0.  We  take 
m  =  0  and  in  (1 1)  we  have 

0 

£«0)Po(S)  «  0 
SM) 
or 

f(0)  -  0 

Stfipose  S  =  ) .  We  take  tn  *1  and  in  (11)  we  have 

1 

£f(S)Pj(S)  -  1. 

S-0 

Now  t(0)  is  already  0  and  we  find 

KDPjO) -  i 

f(l)TRT  p*  1 

fCl)  --p-  • 

Suppose  S  »  2.  We  uke  m  ■  2  and  require 

2 

£«S)P2<S)  •  2 
S-0 

We  already  have  f(0)  ■  0,  f(l)  --p-  .  Hence  we  find 

0  .  1  ^  jj  PQ  +  f(2)  .  P^  -  2 

or 

«2)-|-  * 
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By  induction  it  follows  that 

f(S)  *  I- . 

for  suppose  (9)  is  correct  for  S  r.  ’Ve  then  have  from  (li)  for  3sr  H 
and  taking  m  s  r  ♦  1 

r 

)  ~  Ppt-jC  +  <(r  +  1)  pr+j  <*+l)  mr  +  l  or 
i^O 

(r  •*-  1>  (l  -  r)  f  l\r  ♦  1)  F^1  *  r  +  l  or 


f(r  ♦  1)  P r>‘1  «  (r  ♦  OP*' 


adil  so 


f(r  ♦  1) 


.r+1 

*T~  ‘ 


Hence  i.  e  estimate  given  in  (9)  is  unique;  since  i*  is  sufficient  and  unhinged 
it  ha?  minimum  variance.  We  rewr’te  (9)  as 

(12) 

lq 

fhis  estimate  of  m  (12).  together  with 


H5.TPT- 

m*  y  h*"  simultaneously  for  fn  and$. 


(7) 
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